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Abstract. We show that any set containing a positive proportion of the primes 
contains a 3-term arithmetic progression. An important ingredient is a proof that the 
primes enjoy the so-called Hardy-Littlewood majorant property. We derive this by 
giving a new proof of a rather more general result of Bourgain which, because of a 
close analogy with a classical argument of Tomas and Stein from Euclidean harmonic 
analysis, might be called a restriction theorem for the primes. 



1. Introduction 

Arguably the second most famous result of Klaus Roth is his 1953 upper bound 
on r^lN), defined 17 years previously by Erdos and Turan to be the density of the largest 
set A C [N] containing no non-trivial 3-term arithmetic progression (3AP). Roth was 
the first person to show that r^^N) = o(l). In fact, he proved the following quantitative 
version of this statement. 

Proposition 1.1 (Roth). We have rs{N) ^ 1/loglogA^. □ 

There was no improvement on this bound for nearly 40 years, until Heath-Brown 
and Szemeredi [221 proved that <^ {logN)^'^ for some small positive constant c. 
Recently Bourgain jBj provided the best bound currently known. 

1 /2 

Proposition 1.2 (Bourgain). We have r^{N) <^ (log log A^/ log A^) ' . □ 

The methods of Heath-Brown, Szemeredi and Bourgain may be regarded as (highly 
non-trivial) refinements of Roth's technique. There is a feeling that Proposition 11.21 is 
close to the natural limit of this method. This is irritating, because the sequence of 
primes is not covered by these results. However it is known that the primes contain 
infinitely many 3APs.^ 

Proposition 1.3 (Van der Corput). The primes contain infinitely many 3AP5. □ 

Van der Corput's method is very similar to that used by Vinogradov to show that 
every large odd number is the sum of three primes. Let us also mention a paper of 
Balog [1^ in which it is shown that for any n there are n primes pi, . . . ,pn such that 
all of the averages ^{pi + Pj) are prime. In this paper we propose to prove a common 
generalization of the results of Roth and Van der Corput. Write V for the set of primes. 

Theorem 1.4. Every subset ofV of positive upper density contains a 3AP. 

The author is supported by a Fellowship of Trinity College, and for some of the period during 
which this work was carried out enjoyed the hospitality of Microsoft Research, Redmond WA and the 
Alfred Renyi Institute of the Hungarian Academy of Sciences, Budapest. He was supported by the 
Mathematics in Information Society project carried out by Renyi Institute, in the framework of the 
European Community's Confirming the International Role of Community Research programme. 

^In April 2004 the author and T. Tao published a preprint shoing that the primes contain arbitrarily 
long arithmetic progressions. 
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In fact, we get an explicit upper bound on the density of a SAP-free subset of the 
primes, but it is ridiculously weak. Observe that as an immediate consequence of 
Theorem 1 1.41 we obtain what might be termed a van der Waerden theorem in the primes, 
at least for progressions of length 3. That is, if one colours the primes using finitely 
many colours then one may find a monochromatic SAP. 

We have not found a written reference for the question answered by Theorem II. 4| 
but M.N. Huxley has discussed it with several people [TT)] . 

To prove Theorem 11.41 we will use a variant of the following result. This says that the 
primes enjoy what is known as the Hardy-Littlewood majorant property. 

Theorem 1.5. Suppose that p ^ 2 is a real number, and let Vn = V (1 [1, N]. Let 
{c^njnePjv ^^^2/ sequence of complex numbers with |a„| ^ 1 for all n. Then 



where the constant C{p) depends only on p. 

It is perhaps surprising to learn that such a property does not hold with any set 
A C [A^] in place of Vn- Indeed, when p is an even integer it is rather straightforward 
to check that any set does satisfy (ll.lj) (with C{p) = 1). However, there are sets for 
which ()1.1|) fails badly when p is not an even integer. For a discussion of this see PU] 
and for related matters including connections with the Kakeya problem, see fl9\ 120] . 

We will apply a variant of Theorem 11.51 for p = 5/2, when it certainly does not seem 
to be trivial. To prove it, we will establish a somewhat stronger result which we call a 
restriction theorem for primes. The reason for this is that our argument is very closely 
analogous to an argument of Tomas and Stein [21] concerning Fourier transforms of 
measures supported on spheres. 

A proof of the restriction theorem for primes was described, in a different context, 
by Bourgain [4 . Our argument, being visibly analagous to the approach of Tomas, is 
different and has more in common with §3 of |5j. This more recent paper of Bourgain 
deals with restriction phenomena of certain sets of lattice points. 

To deduce Theorem 11.41 from (a variant of) Theorem 11.51 we use a variant of the 
technique of granularization as developed by I.Z. Ruzsa and the author in a series of 
papers beginning with [SI, as well as a "statistical" version of Roth's theorem due to 
Varnavides. We will also require an argument of Marcinkiewicz and Zygmund which 
allows us to pass from the continuous setting in results such as ()1.H) - that is to say, T 
- to the discrete, namely Z/A^Z. 

Finally, we would like to remark that it is possible, indeed probable, that Roth's 
theorem in the primes is true on grounds of density alone. The best known lower bound 
on r3{N) comes from a result of Behrend j3] from 1946. 

Proposition 1.6 (Behrend). We have r^(N) ^ g-c^iogAf j^^^ some absolute constant 




(1.1) 



C. 



□ 



This may well give the correct order of magnitude for r^{N), and if anything like this 
could be proved Theorem II .41 would of course follow trivially. 
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2. Preliminaries and an outline of the argument 

Although the main results of this paper concern the primes in [N] , it turns out to be 
necessary to consider slightly more general sets. Let m ^ logiV be a positive integer 
and let 6, ^ 6 ^ m — 1, be coprime to m. We may then define a set 

A.b,m,N = {n ^ N \ nm + 6 is prime} . 

We expect Af,,m,Ar to have size about mN/cpijn) log A^, and so it is natural to define a 
function \h,m,N supported on A;, ^.Af by setting 

(f){m) \og{nm + h)/mN if G Ab,m,N 
otherwise. 



^b,m,N{n) 



For simplicity we write X = Ab^rn,N for the next few pages. We will abuse notation and 
consider \b,m,N as a measure on X. Thus for example Xb,m,N(yX), which is defined to 
be J2n ^b,m,Nij')i IS roughly 1 by the prime number theorem in arithmetic progressions. 

We use LP{dXb^rn,N) norms and also the inner product {f,g)x = S /('^)5'('^)-^6,m,Af(^) 
without further comment. 

It is convenient to use the wedge symbol for the Fourier transforms on both T and Z, 
which we define by f^{n) = f f{6)e{—n6) dO and g'^iO) = Xln respectively. 
Here, of course, e(a) = e^™. 

For any measure space Y let B(Y) denote the space of continuous functions on Y 
and define a map T : B{X) B(T) via 

T : / ^ (/^,„,;v)^ (2.1) 

The object of this section is to give a new proof of the following result, which may be 
a called a restriction theorem for primes. 

Theorem 2.1 (Bourgain). Suppose that p > 2 is a real number. Then there is a 
constant C{p) such that for all functions f : X ^ C we have 

\\Tf\\,^C{p)N-'/^fh. (2.2) 

Remember that the norm is taken with respect to the measure Xb,m,N- Theorem 12. II 
probably has most appeal when 6 = m = 1, in which case we may derive consequences 
for the primes themselves. Later on, however, we will take m to be a product of small 
primes, and so it is necessary to have the more general form of the theorem. 

We turn now to an outline of the proof of Theorem 12.11 The analogy between our 
proof and an argument by Tomas J24J, giving results of a similar nature for spheres in 
high- dimensional Euclidean spaces, is rather striking. In fact, the reader may care to 
look at the presentation of Tomas's proof in whereupon she will see that there is 
an almost exact correspondence between the two arguments. 

To begin with, the proof proceeds by the method of T and T*, a basic technique in 
functional analysis. One can check that the operator T* : B{T) — >■ B{X) is given by 

T-'-.g^g'^lx, (2.3) 

by verifying the relation 

{Tf,g)j = J {fXb,m,Nne)W)de = J]/(n)7HA.,n^,7v(n) = {f,T*g)x. 
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The equation ()2.3p explains the term restriction. Using ()2.3p we see that the operator 
TT* is the map from B{T) to itself given by 

TT* : / ^ / * AC,„,^. (2.4) 

Now Theorem 12.11 may be written, in an obvious notation, as 

WTh^P ^ C{p)N-'/P. (2.5) 

The principle of T and T*, as we will use it, states that 

\\T\\l^,= \\TT*\\,,^,= \\T%^2 (2.6) 
We would like to emphasise that there is nothing mysterious going on here - this result is 
just an elegant and convenient way of bundling together some applications of Holder's 
inequality. The proof of the part that we will need, that is to say is the inequality 
ll^llLp ^ II^^IIp'^P) is simply 

\\Tf\\, = sup {Tf,g) 

Il9llp'=l 

= sup {f,T*g) 

Il9llp'=l 

^ II/II2 sup \\T*g\\2 

Il9ilp'=l 

= sup {g,TT*gy/' 

Il9ilp'=l 

^ ll/ll2||TT*||J/!,^. 

Thus we will, for much of the paper, be concerned with showing that the operator TT* 
as given by ()2.4|1 satisfies the bound 

\\TT*\\p,^p!^C'{p)N-^/P. (2.7) 

The preceding remarks show that a proof of this will imply Theorem 12.11 To get such 
a bound one splits A into certain dyadic pieces, that is a sum 

K 

\m,N = ^^^3+ i^K+l- (2.8) 

i=i 

The slightly curious way of writing this indicates that the definition of ipK+i will be a 
little different from that of the other ipj . We will define these pieces so that they satisfy 
the L^-L°° estimates 

||/*^;^|U«,2-(i-)^||/||i (2.9) 
for some e < {p — 2)/2, and also the L^-L^ estimates 

ll/*^,"l|2«.^||/||2. (2.10) 

Applying the Riesz-Thorin interpolation theorem (see JT], Chapter 7) will then give 

ll/*v^;il,«2-^^-iv-2/^ll/ll,, 

for some positive 6 (depending on e) . Summing these estimates from j = 1 to i^T + 1 
will establish ()2.7p and hence Theorem 12.11 

To define the decomposition ()2.8|) we need yet more notation. From the outset we 
will suppose that we are trying to prove Theorem 12.11 for a particular value of p - 
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the argument is highly and essentially non-uniform in p. Write A = 4/(p — 2). Let 
1 < Q ^ (logA^)"^. If b,m,N are as before (recall that m ^ logA^) then we define a 
measure A[*^ ^ on Z by setting 



ripsSQ ~ p) ^ iin and p \ {nm + b) ^ p > Q 



otherwise. 



Define A[^^^(n) = for all n. 

As Q becomes large the measures A^*^ ^ look more and more like Xb,m,N- Much of 21 
will be devoted to making this principle precise. We will sometimes refer to the support 
of A^*^^ as the set of Q-rough numbers. 

Now let K be the smallest integer with 



and define 

for j = 1, . . . , K and define 



2^>3i^(logiV)^ (2.11) 
= aS,^ - XZ:^ (2.12) 



i'K+1 — Xb,m,N - Xf^m^Ni (2-13) 

SO that ()2.8j) holds. In the next two sections we prove the two required estimates, ()2.9j) 
and (121111. 

Let us note here that the main novelty in our proof of Theorem l2. li lies in the definition 
of the dyadic decomposition ()2.8|1 . By contrast, the analogous dyadic decompositions 
in [3] take place on the Fourier side, requiring the introduction of various smooth cutoff 
functions not specifically related to the underlying arithmetic structure. 

3. An L^-L^ estimate 

It turns out that the proof of ()2.10|) . the L^-L^ estimate, is by far the easier of the 
two estimates required. We have 

ii/*^,^ii2 = \\m\2 



Suppose first of all that 1 ^ j ^ K . Then 

1I7/; II < iia(2') II ^ iia(2'"')|| 

IIV^jlloo =^ ll^6,m,Af lloo "T ll^b,m,Af lloo 

p\m p\m 

The two products here may be estimated using Merten's formula Chapter 22]: 
This gives 

||^,||oo«j7iV, (3.1) 



J l|oo| 

'illoollJ 112- 
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and hence 

ll/*^,"l|2«^||/||2, (3.2) 

which is certainly of the requisite form ()2.10j) . For j = K + 1 we have 

< log N/N, 

so that 

ll/*^^+ll|2«^||/||2. (3.3) 

This also constitutes an estimate of the type ()2.10|) for some e < {p — 2)/2. Indeed, 
recalling our choice of A and K (viz. ()2.11|l ) one can check that 2^ ^ (log NY^'^ for 
some such e. 



4. An L^-L°° estimate 
This section is devoted to the rather lengthy task of proving estimates of the form 



Introduction. The first step towards obtaining an estimate of the form ()2.9|1 is to observe 
that 

ll/*^,"l|oo^||^,"||oo||/||l. (4.1) 

We will prove that H'i/'^lloo is not too large by proving 

Proposition 4.1. Suppose that Q ^ (logA^)"^. Then we have the estimate 

\\Km,N->^fl%\\o0<^\0gl0gQ/Q. 

The detailed proof of this fact will occupy us for several pages. Let us begin, however, 
by using 1)4.11) to see how it implies an estimate of the form ()2.9p . If 1 ^ j ^ then 
we have 

|L;,A|| _ ||\(2^)A _ \(2^"')A|| 
IIV^jlloo — \\^b,m,,N ^b,m,N Woo 

<r \\\^ - II -L II \A _ \(2'"')^|| 

^ ll^b,m,Ar ^b,m,N\\oo ' ll^b,m,7V ^b,m,N II oo 

^ logj72^-. (4.2) 

This is certainly of the form ()2.9p . The estimate for j = K + 1 is even easier, being 
immediate from Proposition 14.11 

To prove Proposition 14. II we will use the Hardy-Littlewood circle method. Thus we 
divide T into two sets, traditionally referred to as the major and minor arcs. It is perhaps 
best if we define these explicitly at the outset. Thus let p be the exponent for which we 
are trying to prove Theorem 12. II Recall that A = 4/{p — 2), and set B = 2^4 + 20. These 
numbers will be fixed throughout the proof. By Dirichlet's theorem on approximation, 
every 9 eT satisfies 

^ a (logN)^ , , 

0-- ^ ^ \' 4.3 
qN ^ ^ 
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for some q ^ A^(log A^)~'^ and some a, (a, q) = 1. The major arcs consist of those 6 for 
which q can be taken to be at most (logA^)^. We will write this collection using the 
notation 

m= [j Ma,,. 

g<(logAr)S 
{a,q)=l 

For these 6, the Fourier transforms A^'^^ and A^^ ^ depend on the distribution of 
the almost-primes and primes along arithmetic progressions with common difference at 
most (log A^)^. The minor arcs m consist of all other 6. Here different techniques apply, 
and one can conclude that both A^*^^ and A^^^ are small. The triangle inequality 
then applies. 

The ingredients are as follows. The almost-primes are eminently suited to applications 
of sieve techniques. To keep the paper as self-contained as possible, we will follow Gowers 
|H] and use arguably the simplest sieve, that due to Brun, on both the major and minor 
arcs. 

The genuine primes, on the other hand, are harder to deal with. Here we will quote 
two well-known results from the literature. The information concerning distribution 
along arithmetic progressions to small moduli comes from the prime number theorem 
of Siegel and Walfisz. 

Proposition 4.2 (Siegel- Walfisz). Suppose that q ^ (logA^)^, that {a,q) = 1 and that 
l^Ni^N2^N. Then 

J2 logp = ^' 7^' + O (iVexp(-CBV^l^)) • □ 

p=a(mod q) 

The rather strange formulation of the theorem reflects the fact that the constant Cb 
is ineffective for any B ^ 1 due to the possible existence of a Siegel zero. For more 
information, including a complete proof of Proposition 14. 2^ see Davenport's book [7J. 

The techniques for dealing with the minor arcs are associated with the names of Weyl, 
Vinogradov and Vaughan. 

The major arcs. We will have various functons / : [N] M with 

||/|U = 0(logiV/iV) (4.4) 

which are regularly distributed along arithmetic progressions in the following sense. If 
L ^ A^(log A^)"^"^""^"^ and if X C [A^] is an arithmetic progression {r,r + q, . . . ,r + 
{L - l)q} with q ^ (logA^)^ then 

E /(^) = ^ i^rM) + 0((log AT)-^)) , (4.5) 

where jr,q depends only on r and q, \'Jr,q\ ^ Q and the implied constant in the O term 
is absolute. This information is enough to get asymptotics for f^{6) when \6 — a/q\ is 
small, as we prove in the next few lemmas. 

For a residue r modulo q, write A',, for the set {n ^ A^ : n = r(modg)}. Write r 
for the function on T defined by t{6) = N^^ '^n<N^i^^)- '^^^ ^^^^ lemma deals with 
f^{e) for 1^1 ^ {log N)^/qN. 
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Lemma 4.3. Letr be a residue modulo q, suppose that \d\ ^ {log N)^ /qN, and suppose 
that the function f satisfies ()4.4|) and ()4.5|) . Then 

J2 f{n)e{en) = V,,(/)r(e) + 0(g-i(logiV)-^). 

n&Nr 

Proof. Set L = N(\ogN)~^^~^~^ and partition A^^ into arithmetic progressions (Xj)^^ 
of common difference q and length between L and 2L, where T ^ 2N/Lq. For each i 
fix an element x,- G X,-. 



5^/(n)e(^n) = /(n)e(^n) 

T T 



IX,; 



2=1 nSXi 



5^ e(^x,)^ (7.,,(/) + 0((logX)-^)) + 0{LN-\-\\ogNf^') 



i=l 



7: 



r,q 



e[t)Xi 



1=1 



N 



+ 0{q-\logN)-^) 



(4.6) 



However 



i=l 



T T 
i=l n£Xi i=l n£Xi 

= J2 ^(^^) + ^(^^-^(logX)^). 

Finally, observe that ifO^r, s^g — 1 then 

J2 <dn) - J2 = 0((logX)^), 



(4.7) 



n£Nr 



neNs 



and so 



n&Nr 



0{N-\logNf). 



Combining this with ()4.6p and ()4.7p completes the proof of the lemma. 

We may now get an asymptotic for f^{6) when 6 is in the neighbourhood of a/q. 



□ 



Lemma 4.4. Suppose that f satisfies the conditions ()4.4|) and ()4.5p and that 9 G 
for some a, q with (a, q) = 1 and q ^ (logX)-^. Write 



Then we have 



= q-'a^MMO - a/q) + 0((logX)-^). 



(4.8) 



(4.9) 
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Proof. Write (3 = 9 — a/ q. Then 

f\0) = 5^/(n)e(H 



"' ^ ' '-in 



^ e{ar/q) f{n)e{[3i 

r(modq) ni^Nr 

= q-'m E e(ar/g)7.,,(/) + 0((logiV)-^) 

r(mod q) 

= g-V,,,(/)r(/?) + 0((logiV)-^). 
This concludes the proof of the lemma. □ 

To apply these lemmas, we need to show that / = jy and / = Xb,m,N satisfy ()4.4j) 
and ()4.5|) for suitable choices of '~ir,q{.f)- We will then evaluate the sums aa,q{.f)- This 
slightly tedious business is the subject of our next four lemmas. 



Lemma 4.5. / = Xb,m,N satisfies ()4.4j) and ()4.5|1 with 

7r,g(/) = 



(j){m)q/ (j){mq) if {mr + b, mq) 
otherwise. 



Proof. This is a fairly immediate consequence of the Siegel-Walfisz Theorem (Propo- 
sition Let X = {r,r + q, . . . ,r + [L — l)q} be any progression contained in [A^] 
with common difference q ^ (logA^)^ and length L ^ A^(log A^)^^^^'^^^. An element 
r + jg G X lies in ,„,iv precisely if (mr + b) + jmq is prime, so the lemma is trivially 
true unless (mr + b,mq) = 1. Supposing this to be the case, we may use Proposition 
14.21 Recalling that m ^ log A^, one has 

Ab.m,7v(-^) = —, + O fmgexp(-CB+i V'logmgA^)j 



(j){mq)N 

An 

+ ((log AT) 



L ( (j){m)q ^ 



A^ \(j){mq) 

as required. □ 
Lemma 4.6. / = A[^ satisfies (jOj) and IK^i with 

( Up^Q (l - I) Up<Q (l - I) if {rnr + b, mq) is Q-rough 

lr,q[f) = < rArn ^ ' ■p\mq ^ ^ 

y otherwise. 

Proof. Consider an arithmetic progression X = {r, r + q, . . . ,r + {L — l)q}. Let pi, . . . ,pk 
be the primes with p ^ Q and p \ m. If (mr + b, mq) is not Q-rough then pi \ {mr + b, mq) 
for some i, and the second alternative of the lemma clearly holds. Suppose then that 
(mr + b,mq) is Q-rough. We will apply the Brun sieve to estimate A[*^^(X). 

Let X G X be chosen uniformly at random, and for each i let Xj be the event 
Pi\{mx + b). Since Pi \ {mr + b, mq), the probability of Xj is ei/pi + 0{L~^), where = 
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if Pi I g and = 1 otherwise. Now we have 

say. By the inclusion-exclusion formula it follows that for every positive integer t 

U = t(-'r E tl%/P:+0(L-')±('l). (4.11) 

s=0 l^ii<---<is^k j=l s=l ^ ^ 

It is helpful to have the error term here in a more usable form. To this end, observe 
that it is certainly at most 0{k^/L). We wish to replace the main term in 1)4.111) by 
HiLi (1 — ej/Pi), which is equal to the completed sum 

Doing this introduces an error 

s=t+l l^ii<---<is^k j=l 

which is bounded above by 

s=t+l ' \i=l 

By another result of Mertens one has Y1'^=iP7^ ^ log log Q + 0(1). Hence if t ^ 
3 log log Q then each term in ()4.12)1 is at most one half the previous one, leading to the 
bound 

^ 2 (log log g)* ^ Aeloglogg 
' ' ^ t\ ^ V ^ 
Combining all of this gives 

k 

U = 1[{1~ e,/Pi) + 0{k'/L) + O ((4eloglogg/t)*) . 
1=1 

Using the trivial bound k ^ Q, and choosing t = log A^/ 2 A log log A^, one gets 

k 

U = l[{l-e,/p.) + 0{N-'/'^) 

i=l 

n(i-i)+o(iv-/"). 



p\mq 
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The lemma is immediate from this and ()4.10j) : we have 

/ \ 



^b,m,N 



P^Q 

p\'m \r)\mq I 

= ^br,, + 0{{\0gN)-^)), 

where 'jr,q has the form claimed. □ 

Building on the last lemma, the next lemma gives an evaluation of o"a,g(A[*^^) and 

an asymptotic for A[^^^(6') when 6 G DJla,q. If Q ^ 2 we say that a positive integer is 
Q-smooth if all of its prime divisors are at most Q. We declare there to be no 1-smooth 
numbers. 



Qf^il) ^ f j ^y. q) = 1 and q is Q-smooth; 



Lemma 4.7. Suppose that {a,q) = 1. Then 

I otherwise, 
where m is the inverse of m modulo q. If 9 E 07la,g then 



ji{q) ( abm 



a 



Hqf l"T^ OiilogNy^) ^f (m, q) = 1 



e{ar/q) 



and q is Q-smooth; 
0((logA^)^^) otherwise. 

Proof. Recall the definition ()4.8j) of aa,q, and also Lemma (4. (j[ We shall prove that 

e{—abTn/q)fi{q) if {m,q) = 1 and q is Q-smooth 
, , ,0 otherwise. 

r{modq) \ 
{mr+b,mq) is Q-rough 

(4.13) 

Now if p\m then p can never divide mr + b, because we are assuming that (m, b) = 1. 
Let go be the largest factor of q which is a product of primes p with p ^ Q and p \ m. 
Then the sum fl4.13|l is just 

Yl (4-14) 

r(mod(j) 
{qo,mr+b)=l 

Set qi = q/qo and write, for each r mod q, r = kq^ + s where ^ A; ^ gi — 1 and s is a 
residue mod go- Then the sum fl4.14|) is 

s(mod(jo) k=Q ^ ^ s(modijo) k=0 

{qo,ms+h)=l {qo,mr+b)=l 

Now a is coprime to g and hence to gi, and therefore the rightmost sum here vanishes 
unless gi = L This is the case precisely if go = g, which means that (g, m) = 1 and g is 
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Q-smooth. In this case, the sum is 

Yl e(as/g). 

s(mod(j) 
{q,ms+b)=l 

Set t = ms + b. Then this sum is just 

Ef arnit — h)\ , , , , \ ^ 

e I I = e[—abm/q) y, elamt/q) 

t(modg) \ 1 / (g,t)=l 

(g,t)=i 

= e(—abfn/q)fi{q). 

This last evaluation, of what is known as a Ramanujan Sum, is well-known and is 
contained, for example, in This proves ()4.13|) . 

Now to obtain o"a^g we must simply multiply ()4.13|) by the factor 

p\m p\mq 

appearing in Lemma 14 .61 One gets zero unless (m, q) = 1 and q is Q-smooth, in which 
case it is not hard to see that F = q/(j){q). This completes the evaluation of o'a,q{^^^m tv); 
and the claimed form for A['^^(6') is an immediate consequence of Lemma f4. 41 □ 



We need a version of the above lemma in which A^*^^ is replaced by Xb,m,N- For- 
tunately, we can save ourselves some work by noticing that for fixed q and m we have 

7r,g(Ab,m,Jv) = 7r,q(A[2,^) (4.15) 

for sufficiently^ large Q. Thus <Ja,q{K,m,N) can be evaluated by simply letting Q ^ oo 
in the first formula of Lemma 14.71 We get 

( qMe{-abm/q) if(g,m) = l 

(ra,q{Km,N) = < ^, . (4.16) 

I U otherwise. 
This immediately leads, via Lemma f4.4| to the following evaluation of A^^jy(6'). 
Lemma 4.8. Suppose that {a,q) = 1 and that 9 G 07la,g- then 

,(^^) = <^ 0(g) \ q J \ qj W 6 ; ; ^ v ^^^^^^ 



O ((logA^) ^) otherwise. 



The minor arcs. In this subsection we look at A^^^(6') and A['^^(6') when 6 is not 
close to a rational with small denominator. 

Lemma 4.9. Suppose that a, q are positive integers with (a, q) = 1, and let 6 be a real 
number such that \6 — a/q\ ^ 1/q'^. Then 

KmAd) < (logiV)lO (g-1/2 + JV~l/5 ^ iV-V2gl/2) . (4.I8) 

^Here we regard "fr,qi^b.m,N) and lr,qi^i^2. n) purely formal expressions, so there is no issue of 
whether or not, for example, Lemma l4.7l is valid for "sufficiently large" Q. 
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Thus if 9 em then X^^^nA^) = 0{(\ogNy^). 

Remarks. This is a well-known estimate, at least when b = m = 1. The first (un- 
conditional) results of this type were obtained by I.M. Vinogradov, and nowadays it is 
possible to give a rather clean argument thanks to the identity of Vaughan [211 • Chapter 
24 of Davenport's book j7j describes the use of Vaughan's identity in the more general 
context of the estimation of sums ^„<Ar A(n)/(?7,). To obtain Lemma lOl we used this 
approach, but could afford to obtain results which are rather non-uniform in m due 
to the restriction m ^ logA^ under which we are operating. Details may be found in 
the supplementary document ^21- We remark that existing results in the literature 
concerning minor arcs estimates for primes restricted to arithmetic progressions, such 
as |21IIZ|, strive for a much better dependence on the parameter m. □ 

Lemma 4.10. Suppose that a, q are positive integers with (a, q) = 1, and let 6 be a real 
number such that \9 — a/q\ ^ 1/q'^- Then 

aSOv(^) « (logiv)' {q-' + qN-' + iv-V8A) _ ^4 

Thus if 6 em then \fi%{e) = 0{{logN)~^). 

Proof. Let pi, . . . ,pk be the primes less than or equal to Q which do not divide m. 
Another application of the inclusion-exclusion principle gives 

Ag^v(^) = N-M-be/m) n f 1 - -) ' HO), 

i=i \ 

where 

w = i:(-ir E E ^ 

■5=0 l^ii<---<is^k l^y^Nm/pi-^...pi^ 

y=b{modm) 

Summing the geometric progression, one sees that the inner sum is no more than 

min{||6'pi, . . .piJ\~'^,2mN/pi^ . . .p^J . 

We will split the sum over s in ()4.2()j) into two pieces, over the ranges s e [0, t] and 
s e (t, k] where t = log A^/2y41oglog A^. Each of the primes pi is at most Q ^ (log A^)"^, 
so the product of any s ^ t oi them is no more than y/N. Of course, all such products 
are distinct and so 

E(-l)^ E E ef ^P'^-^P'^y ) ^ E ^M\\0n\\-\2mN/n). 

s=0 l^ii<-<is^ky^Nm/pij^...p,^ ^ ^ n^v^ 

y=b{mod m) 

This is a quantity whose estimation is standard in this area because of its perti- 
nence to the estimation of exponential sums on minor arcs. It is bounded above by 
C(log Ar)3(ArV2 + g + Nq-^); details may once again be found in [T^ . 



OPh ■■■PisV 



m 



(4.20) 
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On the other hand 



E 



s=t+l l^ii<---<is^fc y^Nm/pi-^ ...pi^ 

y=b{mod m) 



^ 2mNY: E IK 

s=t+l l^ii<---<is^k j=l 



GPh ■■■PisV 



m 



^ 2mN + 



s=t+l 

^ AmN{2e\og\og\og N/ty ^ mA^^^^/^^ ^ a^i-i/8A_ 

Since n^=i(l ~ ^/Pi)^^ ^ ^ogN, the claimed bound follows. □ 

Proof of Proposition \4.1\ Suppose first of all that 9 G Tla,q for some a, q, and recall 
Lemmas 14.71 and 14.81 If q is Q-smooth then 

Km,N{(^) - aSOv(^)| = 0{N{\ogN)-^). 
If q is not Q-smooth then q > Q and so we get 



^ \'^b,m,N\ 



.WI + |AgOvWI 

< 2/0(g) + O((logiV)"^) 

^ 41oglogg/g + 0((logiV)-^), 

the last estimate being contained in ^3], Chapter 17. Since we are assuming that 
Q ^ (logA^)^ this expression is 0(\og\og Q/Q). If, on the other hand, 9 & m then we 
have 



, |\(Q)A 



^b,rn,N 



= 0((logiV)-^) 

This at last completes the proof of Proposition 14.11 

5. Restriction and majorant estimates for primes 
In this section we prove Theorems 11.51 and 12.11 

We have already seen, in ()4.ip and ()4.2|) . how Proposition 14.11 implies an L^-L 
estimate for the operator f \—>- f * tpj of the form p.9j) . In fact, we have 

logj 



□ 



\\f*H 



2^ 



(5.1) 



for all j = 1, . . . , i^' + 1. For each fixed j = 1, . . . ,K, one can use the Riesz-Thorin 
interpolation theorem to interpolate between ()3.2j) and 1)5.11) . This theorem, which is 
discussed in |1H Chapter 7] , is better known to analytic number theorists as the type of 
convexity principle that underpins many basic estimates on ( and L-functions. It gives 



ll/*^illp < j2/P(log^-)l-2/P2-(l-2/p)iiV-2/P 



(5.2) 
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For j = K + 1 another interpolation, now between ()3.3|) and (jS-H) . instead gives 

« (logiV)2/^'(logir)i-2/P2-(i"2Mi.. 
Recalling at this point the definition ()2.1H) of K we see that this implies 

||/*Wl|p«(logiV)-VPiV-2/p. 
Summing this together with ()5.2j) for j = 1, . . . ,K gives, because of the decomposition 

(EH), 



\\f*\m,N\\p^CiP)N 



-2/p 



p.. 



As we have already remarked. Theorem 12 . II follows by the principle of T and T*. □ 

Now we prove Theorem II .51 Although we will need a slightly different result later on, 
this theorem seems to be the most elegant way to state the majorant property for the 
primes. 

Proof of Theorem \1.3. Let {an)n&^M ^'^Y sequence of complex numbers with |a„| ^ 1 
for all n. We apply Theorem 12. II to the function / defined by f{n) = an/ logn. Writing 
out the conclusion of Theorem 12.11 gives, for any p > 2, 

/(n) log ne(n6') 



Therefore 



ane[nt 



de «, m/'-' Yl 



p/2 



\neVN 



logn 



However it is an easy matter to check that 



E 



einb 



de > 



\^l/2N 



de > NP-^{\ogN)-P. 



This proves Theorem 11.51 for p > 2. For p = 2 it is trivial using Parseval's identity. □ 

6. Roth's theorem in the primes 

Let Aq be a subset of the primes with positive relative upper density. By this we 
mean that there is a positive constant ao such that, for infinitely many integers n, we 
have 

\AnVn\^ aon/logn. (6.1) 

This is not a particularly convenient statement to work with, and our first lemma derives 
something more useful from it. 

Lemma 6.1. Suppose that there is a set Aq (1 V with positive relative density, but 
which contains no 3APs. Then there is a positive real number a and infinitely many 
primes N for which the following is true. There is a set A C {1, . . . , [A^/2J}, and an 
integer W E log log A^, ^ log log A^] such that 

• A contains no 3APs 

• K,m,N{A) ^ a for some b with {b,m) = 1, where m = Y[p<wP- 
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Proof. Take any n ^ for which holds. Let W = [^loglognj, and set m = 
Y[p<wP- Choose N to be any prime in the range (2n/m, 4n/m]. Now there are certainly 
no more than m elements of Aq which share a factor with m, and no more than n^^^ 
elements x G Aq with x ^ n^^'^. Thus 

^ ^ Aoix)\ogx ^ aon/2, 

b:{b,m)=l x!^n 

x=b{modm) 

and for some choice of b we have 

Ao{x)\ogx ^ aon/2(f){m). (6.2) 

x=b{modm) 

Write A = {{Aq fl [n]) — h). This set, being a part of Aq subjected to a linear trans- 
formation, contains no 3-term AP. It is also clear that A C {1, . . . , [A^/2j }. Furthermore 
()6.2|1 is equivalent to 

A{x) \og{mx + b) ^ aon/2(f){m)^ 

xfN 
mx+b is prime 

which implies that Xb,m,N{A) ^ aQn/2mN ^ ao/S. The lemma follows, with a = oq/S. 
□ The reason we stipulate that A be contained in {1, . . . , [A^/2J} is that A does not 

contain any 3APs when considered as a subset of Z^r = Z/A^Z. This allows us to make 
us of Fourier analysis on Zat. If / : Za? ^ C is a function we will write, for any r G Zat, 

f{r) = Yl f{x)e{-rx/N). 

Observe that / may also be considered as a function on Z via the embedding Zjv ^ [N] , 
and then f{r) = f^{r/N). 

For notational simplicity write /i = Xb,m,N- We will consider A and as functions on 
Zjv. Write a = Afi. We will continue to abuse notation by using fi and a as measures. 
Thus, for example, a{Ziy) ^ a. 

Now if A contains no (non-trivial) 3APs then 



'^^a{x)a{x + d)a{x + 2d) = ^^a(x) 

x,d X 

^ $^/i(x) 



^ (logA^)VA^2. (6.3) 

We are going to show that this forces a to be small. We will do this by constructing 
a new measure Oi on Z^r which is set-like, which means that Oi behaves a bit like A^^^ 
times the characteristic function of a set of size ~ aA^. The new measure ai will be 
fairly closely related to a, and in fact we will be able to show that 

ai(a;)ai(x + (i)ai(x + 2(i) is small. (6.4) 

xA 
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This, it turns out, is impossible; an argument of Varnavides based on Roth's theorem 
tells us that a dense subset of Z^v contains lots of 3APs. We will adapt his argument 
in a trivial way to show that the same is true of set-like measures. 

The arguments of this section, then, fall into two parts. First of all we must define 
ai, define the notion of "set-like" and then show that oi is indeed set- like. The key 
ingredient here is Lemma ffi.2| which says that Jl is small away from zero. Secondly, we 
must formulate and prove a result of the form ()fj.4|l . For this we need Theorem 12.11 the 
restriction theorem for primes. 

The idea of constructing ai, and the technique for constructing it, has its origins in 
the notions of granularization as used in a paper of I.Z. Ruzsa and the author 9J. In 
the present context things look rather different however and, in the absence of anything 
which might be called a "grain" , we think the terminology of jHI no longer appropriate. 

Let us proceed to the definition of ai. Let 5 G (0, 1) be a real number to be chosen 
later, and set 

R = {reZN: \a{r)\ ^ 5}. 

Let k = \R\, and write R = {ri, . . . ,rk}. Let e G (0, 1) be another real number to be 
chosen later, and write B{R, e) for the Bohr neighbourhood 

{xGZ^: ll^ll ^eV^G [fc]}. 

Write B = B{R,e) and set P{x) = B{x)/\B\. Define 

ai = a* (3 * (3. (6.5) 

It is easy to see that 

ai(ZAr) ^ a. (6.6) 

In Lemma [6 . 31 below we will show that ||ai||oo ^ 2/iV, provided that a certain inequality 
between e, k and W is satisfied. This is what we mean by the statement that ai is 
set- like. 

Lemma 6.2. Suppose that N, and hence W , is sufficiently large. We have 

sup |/I(r)| ^ 2\og\ogW/W. 

Proof. Recall that Jl{r) = fi^{r/N). There are three different cases to consider. 

Case 1. r/N G 2Jto,i, that is to say \r/N\ ^ (log A^)'^/A^. Then by Lemma f4. 81 we have 
the asymptotic 

Jl{r) = T{r/N) + O(logiV)-^. 
Observe, however, that T{r/N) = provided that r 7^ 0. 
Case 2. r/N ^ ^a,q- Then Lemma f4.8l gives 

J^ir) = f-^l r(^-^-)+ O(logiV)-^ 



V Q J \N q 
where 

1 (g, m) = 1 



Xq 



otherwise. 
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Since m = np<vK^'' certainly have = for g ^ W. Thus indeed 
\Jl{r)\ ^ sup (f){n)-^ + {log N)-^ ^ 2\og\ogW/W. 

Case 3. r/N G m. Then Lemma Ol gives Jl{r) = fi^{r/N) = 0((log A^)~^). □ 

Lemma 6.3. Suppose that ^ 2\og\ogW /W . Then the measure ai is set-like, in the 
sense that we have ||ai||oo ^ 2/N . 

Proof. Indeed 

ai(x) = a* (3 * P{x) 
^ fi* P * (3{x) 

r 

^ N-'mm'+N-'j2\^^{rmr)\' 

^ N^^ + N-'^ sup \Jl{r) I V |/3(r) |2 

= N-^ + \B\~^ sup \Jl{r)\ 

1 2 log log 

W\B\ 

Now by a well-known application of the pigeonhole principle we have \B\ ^ e^N , from 
which the lemma follows immediately. □ 

We move on now to the second part of our programme, which is a statement and 
proof of a result of the form ()fi.4|l . 

Proposition 6.4. We have 

Y,<x)a,{x + d)a,{x + 2d) <: CN-^'^ + ^ {2^hH-'''^ + C6^'^) . 

x,d 

We will require several lemmas. The most important is a "discrete majorant prop- 
erty" . Before we state and prove this, we give an elegant argument of Marcinkiewicz 
and Zygmund [23 • We outline the argument here since we like it and, possibly, it is not 
particularly well-known. 

Lemma 6.5 (Marcinkiewicz-Zygmund). Let N he a positive integer, and let f : [N] — >■ 
C be any function. Consider f also as a function on Z]^. Let p > 1 be a real number. 
Then we have 

E l/WI' = E \f"(^W ^ c{p)N / \f{e)\^de. 

Proof. Consider the function 
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This function is equal to 1 for all n with |?t,| ^ N . Its Fourier transform, g^{6), is equal 
to 2K2n{0) — Kn{9), a difference of two Fejer kernels. Thus we have 



N 



and so 

l7(r)r = \f\r/N)\^ 



f\e) i2KMr/N -9)- KN{r/N - 9)) d9 



< 3^"^ ( 2P 



r{9)K2N{rlN~9)d9 



f''{9)KN{r/N -9) d9 



^ 3P-1 2P / \f\9)\^K2N{r/N -9)d9+ / \r{9)\^Kj,{r/N 



'd9 



by two applications of Jensen's inequality. It is necessary, of course, to use the fact that 
the Fejer kernels are non-negative. To conclude the proof, one only has to show that 

N-l 



^KNir/N -9) ^ CN, 



r=0 



together with a similar inequality for K2n- But this is a straightforward matter using 
the bound 



N-l 



N-l 



. _ j.^ r .7 .7 + 1 1 



r=0 



j=0 06lir,^ 



together with the estimate 



valid for |0| ^ 1/2. 



□ 



Lemma 6.6 (Discrete majorant property). Suppose that p > 2. Then there is an 
absolute constant C{p) {not depending on a) such that 

J2\a{r)\'^Cip). 

r 

Proof. A direct application of Theorem 12.11 gives 

\a''{9)\Pd9 ^ C'{p)N-\ 

□ 

^ 2iV. 



The lemma is immediate from this and Lemma 16.51 



Lemma 6.7. Suppose that r G R. Then 



I- Pirypi-2ry 
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Proof. We have 



1 - (3{r] 



1 
1 



^ (1 - e{rx/N)) 



^ (1 - cos(27rra;/iV)) 



xeB 

2 / Ani2 



^ 47r''sup ||ra;/iV|| 
xeB 

A very similar calculation shows that 

1 - P{-2r) ^ 64e2, 

and the lemma follows quickly. □ 

Proof of Proposition \6.4\ By (16 .HI) we have, observing that oi = 
''^^ai{x)ai{x + d)ai{x + 2d) ^ ai{x)ai{x + d)ai(x + 2d) 

- a{x)a{x + d)a{x + 2d) + (log N)^N-'^ 

= 0(iV-3/2) 

^a(r)2a(-2r) (l - /3(r)^/3(-2r)2) . (6.7) 

r 

Split the sum in ()6.7j) into two parts, that over r & R and that over r ^ R. When r & R 
we use Lemma IfTTI to get 



^a(r)2a(-2r) (l-/?(r)V(-2r)2) ^ 2^V|i?| 



this last inequality following from Lemma f6.6l with p = 5/2. To estimate the sum over 
r ^ R, we again use Lemma [6. 61 with p = 5/2. Indeed using Holder's inequality we have 



J2a{r)^a{-2r) (^1 - /5(r)^/3(-2r)^ 



This concludes the proof of Proposition 16.41 



^ 2 sup I a(r ) 1 V|S(r) 1 



□ 



By fl6.6j) and Lemma WT^ a\ behaves a bit like a measure associated to a set of size 
aN . As promised, we use this information together with an argument originally due to 
Varnavides j2H] to get a lower bound on ^ai(x)ai(x + (i)a\[x + 2di). 

Lemma 6.8. For some absolute constant C2 we have 

ai{x)ai{x + d)ai{x + 2d) ^ exp (-C2a;"2 log(l/a)) A^~^ 
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Proof. Let A' = {x E Z^v : ai{x) ^ a/2N}. By Lemma f6. 31 we have 

E2\A'\ a , 
a,ix) ^ ^ + —\A% 

which imphes that \A'\ aN/A. We will give a lower bound for Z, the number of 3APs 
in A'. It is clear that ^ai(x)ai(x + d)ai{x + 2(i) is at least a^Z/8N^. 
Now by Bourgain's theorem^ [Hj there is a constant Ci such that if 

M ^ exp log(l/a)) 

then any subset of {1, . . . ,M} of density at least a/8 contains a SAP with non-zero 
common difference. Now there are exactly A^(A^ — 1) non-trivial arithmetic progressions 
of length M in Zat, and A' will have density at least a/8 on many of them. To estimate 
exactly how many, fix a common difference d 0, and let I = {0, d, 2d, . . . , (M — l)d}. 
We have Ylx ^ * -^(■^) ^ aNM/A, but A' * I{x) ^ M for every x. Thus another simple 
averaging argument shows that A' * I{x) ^ aM/8 for at least aN/8 values of x. 

In total, then, there are at least aN'^/8 progressions of length M on which A' has 
density at least a/8. Each of them contains a SAP consisting of elements of A'. No 
SAP thus counted can arise from more than progressions of length M. Thus we 
have two different ways of bounding Z, and putting them together gives 

Z ^ aN^/8M^. 

The lemma follows. □ 

Combining this with Proposition 16. 4[ we get 

C'N-'/^ + 212^25-5/2 + c6'/2 ^ (_C2a-2 log(l/a)) . (6.8) 
There are constants C3, C4 so that if we choose 

6 = exp (— Csa"^ log(l/a)) 

and 

e = exp (— C4a~2 log(l/a)) 

then ()6.8|1 cannot hold, and we will have derived a contradiction to the assumption 
that A contains no SAPs. We are permitted to choose any values of e and 6 so that 
the condition of Lemma 16.81 is satisfied. Recalling that k ^ C5~^^'^ (a consequence of 
Lemma inini) and that W ^ loglogA^/8, we see that ()6.8p can indeed be contradicted 
provided that 

a ^ C.lp^. (6.9) 

The subscripts indicate the number of iterated logarithms, not the base to which those 
logarithms are taken! 

Let us remind the reader of what it is that we have contradicted. We assumed that 
there was a subset Aq <Z V of positive relative upper density, containing no SAP. The 
number a was related to the relative upper density of Aq, via the slightly technical 
reductions made in Lemma (6. II A bound of the form ()6.9|1 also holds for ao- That is, 
any subset of P„ with cardinality at least Cn(log5 nY^"^/ log r;,(log4 n)^/^ contains a SAP. 



■^We could equally well use Roth's original theorem here, at the expense of making any bounds for 
the relative density in Theorem 1 1 . 41 even worse. 
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By far the most important reason for us getting such a poor bound was the need to 
prove Lemma f6. 21 which says that by passing to a subprogression of common difference 
m = Y[p<wP make the primes look somewhat uniform. This is a rather crude 

trick but we have not been able to get around it. Even if we could, the resultant bounds 
would surely be many miles from the probable truth, which is that any subset of [A^] of 
cardinality N(\ogN)~^^^^ contains 3APs. 

Let us conclude by remarking that the methods of this section use rather little about 
the primes. In fact by the same argument one could establish a Roth-type theorem 
relative to any measure fi : Zat IR+ for which one had good control on sup^_^o l/^l'")! 
together with bounds for for some p G (2,3) and any / satisfying ^ /(x) ^ 

IJ,{x) pointwise. In practise bounds of this latter type will come by restriction theory 
arguments of the type given in ^ A more general setting for our arguments, along the 
lines just described, is given in jTHj. 
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